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I. Background and Introduction 

During the early months of I969 the U. S. Office of Child Development 
planned a three vave longitudinad study designed to assess the relative 
impacts of a variety of preschool curricula. The study vas catlled 
Head Start Planned Variation (HSPV) and "began in the fall of 1969- Plans 
called for the systematic assignment of a nuraher of pre-specified curri- 
cula, each to tvo or more sites throughout the country. Selected sites 
vere to meet three criteria.- 

First > each site was to contain an on-going Head Start Program. No 
funds were allocated for serving children other than those already "being 
served hy Head Start. 

Second, each site was to draw participant children from a preschool 
population living largely within the attandance area of a school or schools 
where older children attended a Follow-Through program. By fall 19^9 > 
most Follow-Through schools had adopted one of a nmher of well-defined 
educational curricula. These programs were "being evaluated hy the Office 
of Education. Children entering selected Follow-Through schools during 
the years 1969-1972 were to "be tested at entrance and longitudinally 
followed and tested until they completed Follow-Through at the end of the 
third grade. 



Follow-Tlirough is an intensive early elementary (K-3) compensatory propram 
designed to enrich the experiences of economically poor children — particu- 
larly poor children who have had Head Start experiences. Originally inten^Ied 
to "be a national program, Follow-Through was designated as an experimental 
effort in 1963, one year after it was initiated. By I969 there vere over 110 
school districts with Follow-Through programs. 
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Third, each selected Head Start site had to adopt the curriculum 
model being used in the Follow-Through schools in its area. Aid in imple- 
menting the models was provided by consultants responsible to the original 
architects of the models. In addition, extra funds for purchasing equip- 
ment and for hiring teacher aides were provided to the selected Head Start 
classes. Overall, the cost of implementing the Planned Variations model 
was estimated to be $350,00 per child above the cost of conventional Head 
Start (see McMeekin, 1973). xnce many of the Follow-Through curricula 
were adopted from programs originally designed for pre-schools, the use of 
them in Head Start programs was appropriate. 

The design of the Planned Variations study called for children in all 
three waves to be tested at the beginning and end of their Head Start experi- 
ence. Following Head Start, the children would enter the Follow-Through 
program in their community and be evaluated at the beginning and throughout 
their Follow-Through experience. Records of the Head Start and Follow- 
Through evaluation could then be linked. The linkage would provide data 
for a longitudinal assessment of the combined pre-school and early elementairy 
experiences of the Planned Variation children. 

Testing was also planned for other Head Start childxr^n in every Planned 
Variation site. These children would attend Head Start classes without a 
designated curriculum component and would serve as a local comparison group 
in the study of the Planned Variations Head Start classes « With some ex- 
ceptions this strategy was followed for all waves of the Planned Variations 
study. The comparison children were also to be included in the Follow- 
Through evaluation. 
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Three sets of progress reports on Planned Variations were planned: first • 
at the end of che Head Start experience for each of the three waves of children; 
second, at various times during the Follow-Through experiences of the three 
waves; and third, in 1976, after the third wave of Head Start children had 
completed Follow-Through. A preliminary report on short-term effects in the 
first wave (1969-1970) was prepared in 1971 by the Stanford Research Institute 
(SRI) for the Office of Child Development. 

This paper focuses on data from the second wave of Head Start Planned 
Variations. SRI collected the data for the second wave and four reports on 
this data were prepared by the Huron Institute. One report considers the 
process and success of implementing the Plaiiaed Variation curricula in the 
various sites. 

A second report presents a detailed description and analysis of the dif- 
ferent measuring instruments used in all three waves of the Planned Variations 
study, (Walker et. al., 1973). We will refer occasionally to this report 
and will briefly summarize some of its major conclusions. The third and 
fourth reports in this series each consider the short-term effects of the 
various curricula on cognitive outcomes of children. One report (Smith, 1973) 
focuses on three questions: 

1. What are the short term effects of an Head Start experience on 
children? 

2. Are there discernable differences between the effect on children of 
a Head Start Planned Variation experience and a conventional Head 
Start experience? 

3o Do Planned Variation models differ in their effects on Head Start 
children? 
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In all instances the measured effects are narrowly defined. Specifically, 
we are concerned with three measures of cognitive achievement, one measure 
of intelligence and one measure of motor control. Although other measures 
were taken, none were considered to have adequate psycometric properties. 
Conclusions of this report are presented here ir^ considerable detail. 

The fourth report (Featherstone, 1973) systematically explores the 
possibility that different characteristics of children interact with particular 
curricula to produce different outcomes. This report uses the data from the 
first cohort to generate hypotheses and tests these hypotheses on data from 
the second cohort. It focuses on only two tests — the Stanford-Binet and 
the Pre-School Inventory (PSI) . A number of the major conclusions of this 
report are summarized here. 

This paper is divided into four additional sections. In section II we 
describe the design of the study and briefly describe the tests and measures. 
The main conclusions of the report by Walker et al. are considered in this 
section. Section III considers the three main effect questions analyzed by 
Smith and Section IV focuses on a number of the major findings of the study 
of interactions. Section V contains some brief conclusions. 
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II, Study Design and Measures 
A. Study Design 

Thirty-seven sites used Planned Vauriations curricula in 1970-71- Twelve 
ctirricula models vere represented. There vere compaurison classes at 14 of 
the 37 sites and at seven (off -site) locations not having Planned Variation 
classes. Tahle I displays this information. Columns 1, 2 and 3 of Tahle I. 
show the names of the twelve curricula models, a site code for each site, 
and the location of the site. The model and site codes were assigned in the 
Follow-Through evaluation (with the exception of the Enahler model) and 
contain no information other than identification of model and site. The 
Enatler sites are unique to the Head Start Planned Variations study. 

Column U indicates the testing level which is described later • Column 
5 of Table II-l shows the year of entry of the site into the Planned Varia- 
tions study. Fifteen of the 37 sites were also in Planned Variations in 19^9 • 

For description of the models see Appendix I. As indicated in the appendix 
the models vary on a numher of different dimensions. 
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Column 6 shows the number of Planned Variation classes in the site 
and Column 7 shovs the number of tested classes. Column 8 shovs the 
niuaber of comparison classes all of vhich vere tested. Blanks in 
Column 6 indicate that the site vas an "off-site comparison" site. 
Note that the off-site comparisons are paired vith Planned Variation 
sites and are given the same site code as a Planned Variation site. 
Blanks in Column 8 indicate that there were no comparison classrooms 
at that site. 

There are a number of things to note about the design. First, nine of 
the 12 models have at least 2 sites and 5 of the 9 have 3 sites. Although 
it would have been preferable to have greater replication, the very fact 
that some systematic replication did occur strengthens inferences about 
model effects. 

Second, the study is not confined to one region of the country. Although 
there is some regionality associated vith some of the various models, e.g., 
the Gordon model sites are all in the South; by and lax'ge, each model is being 
tried at a vari;:^.-"^, >f locations. 

Thii/i, a number of sites do not have comparison classes in their same 
location. This tends to complicate analysis of the data. 

B. Measures Used in the t>/aluation 

The level of testing indicated in Column 1* of Table I refers to the 
evaluation activities carried out in the sites. Due'primarily to economic 
constraints, not all sites were tested on all measures. There were three 
levels of evaluation activities — I, II, III. Tatle II describes the 
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activities included at each of the three levels ♦ Level I is the most 

primitive* Ei^ht Planned Variation sites fall in this category* No 

comparison sites are in Level !• No data gathering at this level 

involved the children* Teachers completed demographic information forms 

and filled out the California Social Competency Scale for each child in 

their classrooms in "both the Fall and Spring* Also, teachers and teacher 

aides responded to a questionnaire requesting information atout their ovn 

"backgroixnds , teaching experiences and attitudes* In addition, Sponsor 

and Head Start Directors rated the level of implementation in the class- 
ic 

rooms in every site* 

All data collected at Level I was also collected at Level II* In 
addition, three other sets of data were gathered at Level II* Classroom 
observations vere made in toth the Fall and Spring "by observers using the 
SRI Classroom Observation Instrument* All children in tested classrooms 
vere administered the Basic Test Battery in both the Fall and Spring* Four 
tests were included in the Basic Battery — the Caldwell Preschool Inventory, 
IIYU Booklet 3D, (a 17 item test of achievement pre-math, pre-science and 
linguistic concepts), NYU Booklet 1*A, (an l8 item test' of achievement — 
letters, numerals, prepositions) and the Motor Inhibition Test* Children of 
Black and Spanish origin, whose parents agreed, were administered a test 
assessing their knowledge of their ethnic heritage* Eleven Planned Variation 
Bites were classified as Level II* Of the eleven, six had comparison classes 
also tested at Level II* 



See appendix for a short description of the measures used in the study* 
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TABLE II 



Three Levels of Planned Variation Evaluation Activities 



Level of Hvaluation 
level 1 

Ij.TeJ'cher completed classroon information 
ferns for child demographic data 

2) Teacher completed California Social 
Coarpetency Scale one for each 
child 

5) Sponsor ratings of Level of Jmplcnent- 
at ion 

4) HcDd Start Directors ratings of Level 
of Inplementation 

$) Teacher and Teacher Aide survey 



Data Collection Period 
Ftjil Spring 



X 
X 
X 



X 
X 



X 
X 
X 

• X 



Level II ..(includes All activities in Level I and the following) 



6) Classroom observations 

7) Basic Child Test Batter. . 
a* Preschool Inventory 

b. KYU Book 3D 

c. KYU Bi>ok 4A 

d* Motor Inhibition Test 

S) Child completed Ethnic Heritage Test 



X 
X 



X 
X 



level in (includes all activities in Levels I and II and the following) 

X X 



9) Stanford-Dinct testing on random one- 
half of children in all tested classes 



10) 8-Blocks Sort Task — given to other 
random one*hnlf of children in all 
tested classes 

11) Parent Inteivicws administered to 
parents of children taking the S-Block 
Sort T;<sk 

J?) Initnsive Case Studios (U. of .Marybnd) 



X 
X 
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Level III cites had all data eollcetion aetivities earried out in 
Level I and Level II sites and» in addition^ four other activities. 
Randomly chosen, one-half of the children in each tested Level III class- 
room vere administered the Stanford-Binet in "both the Fall and Spring. The . 
same children received the test "both times. The children in the other half 
of the class ~ along with one of their parents or guardians — were adminis- 
tered the 8-Block Sort Task in the Spring. Additionally, the parents or 
guardians of the children in this group completed a parent questionnaire 
which asked about attitudes toward Head Start, their child, and the Planned 
Variation model used in their child •s classroom. Eighteen Planned Variation, 
ten on-site comparison, and five off-site comparison sites were assigned to 
Level III. 

In an extensive review of the mc-asures and data collection activities 
of the three cohorts of HSPV Walker et el. (1973) reach four general conclusions. 
These conclusions are as appropriate to the 1970-71 data as they are to the entire 
study. In hrief they are: 

l). A large numher of measures exist which purport to assess the cognitive 
characteristics of preschool yoxmgsters. Many of these measures are apparently 
well constructed and show high internal and test-retest reliability. By and large. 



however, little data exist concerning the long-term predictive validity of the 
measures. In general, dara on predictive validity are limited to correlations 
between the tests given at preschool tirae and similiar tests given in the early 
elementary years. These correlations arc generally moderate falling in the O.liO - 
0.50 range at the outside. One except ion\ to this is the Stanfopd^Binet which shovs 
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a fairly stable relationship to itself tron the preschool years to late adolcsence 
— the correlation "between the Binet at age six and the Binet at age l8 is in 
the range of O.6O -0.70 (see Jencks et al., 1972) • Moreover the Binet seems to 
"be fairly strongly related to eventual years of schooling completed (we estimate 
that r=0.1»2 between IQ at age 5 and eventual years of schooling). The relationship 
to such future outcomes as occupational status and ipcome is considerably less. 
An important drawback to these data, however, is that they represent relationships 
estimated in the absence of effects of preschools. It is entirely possible, for 
example, that preschool treatments which affect the IQ level of young children 
also change the predictive relationship between early and late IQ. Even in the 
absence of data answering these questions, however, the existing cognitive measures 
are psychometric ally far stronger than the measures of "non-cognitive" areas. 

2) . Measures in the ^'non-cognitive" areas should be uniformly classified as 
experimental. With only a few exceptions little data exist as to their relia- 
bility or validity. Moreover, they are characterized by a lack of a clear relation 
to theory and by severe problems in administration. (For an extensive review of 
"non-cognitive" measures for preschool age children see Walker, in press). In 
every '.nstance we classify the non-cognitive measures used in HSPV as inappropriate 
for use as sumraative evaluation instruments. This is particularily unfortunate 

in this stucy given the diversity of goa3.s of the various sponsors in HSPV and 
represents a severe drawback in the evaluations. 

3) . By and large comprehensive norms and assessments of the psychometric 
characteristics of both cognitive and non-cognitive measures do not exist for 
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the population of children served by Head Start . We attempted to alleviate 
this prohlem for the measures used in KSPV by extensively describing the 

data and we recomnend that future researchers follow a similiar strategy 

in future evaluations. 

U]. Over all we found the data collection procedures followed by SRI 

to be highly competent and productive of generally very hicX^ quality data. 

However > although the data collection procedures were exemplary, the design 

of the study raised some sticky problems for analyses. These are considered 

in the next section where findings of the main effects study (Smith, 1973) 

are sximmarized. 
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III. The Main Effects Study 

This section contains tvo parts. First ve consider some of the general 
methodological implications of the design problems mentioned in Section II. 
We then 'samnarize conclusions to the three main effect questions. 
' A, Hethodolo»:;ical lTiiT)lications and Stratefi;y for Analysis 

/ . 
The strategy for analysis v .s dictated in large part hy constraints . 

on the study. First, as we noted earlier, ve focused. principally on the 
analysis of cognitive growth.. To do otherwise would he to seriously over- 
play the existing data. In doing this we recognize that we are not even 
attempting to capture the richness of a preschool experience or the largest 
part of the differences ainong preschools* 

Second, also as described earlier. Planned Variation sites were not 
randomly assigned to models. Rather, the sites were selected on two criteria 
xmrelated to the requirements for an adequate experimental design, and then 
given the opportunity to accept or reject the assigned curriculum. Moreover, 
the local community had control over the specification of which classes within 
a site were to employ the Planned Variation (PV) curriculm. Since the 
selection of comparison classes within the PV sites occurred after selection 
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of the Planned Variation classes, the treatment (PV) and comparison (NPV) 
classes cannot he assumed to he random samples drawn from the same popu- 
lation. Thus, randomization did not occur at either of the two critical 
design points — at the level of assignment of curriciila to sites or at 
the level of assignment of treatment and comparison groups within sites. 

The lack of a true experimental design puts the analysis of the data 
into a never-never land* Had we random assignment of curricula (treatments) 
to sites, then a comparison of treatments would yield us unhiased estimates. 
If we had random assignment of classes to PV and ?IPV groups within sites, 
then^a comparison of the two sets of classes would yield unhiased estimates. 

If we had two random samples of children from the same popiilation one 

going to Head Start and one not, then estimates of the general effects of a 
^Head Start experience would he unhiased* But we have no random assignment, 
so all estimates are hiased in some unknown fashion. Estimation of effects 
thus hecomes an art instead of a science • There are numerous statistical 
techniques to help reduce hias (matching, covariance, blocking, crossed 
designs and standardization techiques). Each -may he helpful depending on 
the adequacy of the structural model we are trying to fit. .That is where the 
essential prohlem lies, for we have no a priori way of determining which is the 
hest analytic model. Given this state of affairs, we follow Tukey's advice: 
"As in the famous discussion hetween Student and Fisher and the interjections 
hy Sir Harold Jeffreys, it may not he a had thing to use all the allowed 
principles of witchcraft and not Just one set." (Tukey, 1973, p. 112). 
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We did not use all the principles of methodolosical witchcraft, tut 
we did use a numbor. In particiaar, our strategies for removing Mas in the 
data depend on (l) our choice of a statistical model; (2) our choice of variables; 
and (3) our assessment of the accuracy with which the data are measured. Different 
decisions in these areas of judgment lead to the use of different statistical 
approaches which in turn leads t'- a variety of estimates of "effects"- In a 
sense, different estimates from different analyses gives us a sense of confidence 
limits for the reported effects. Such an approach will generally inspire caution 
in interpretation, for most of the effects found in this study are snail and some- 
what sensitive to differences in the analytic strategies • On the other hand, large 
effects which turn out to be robust — insensitive to variations in analysis 
methods — presumably should inspire confidence. 
B. Suirjaary of Findings in I!ain Effects Study 

Three main questions were addressed in this study: 

— What are the overall short term effects of a Head Start 
experience on children? 

Are there discernable differences between the effects on 
children of a Head Start Planned Variation experience and 
a conventional Head Start experience? 

Do Planned Variation models differ in their effects on Head 
Start children? 
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Five measured outcomes vere used to assess each question. The PSI is a 
general standardized achievement test for pre-school children. The NYU 
Booklet 3D and NYU Booklet hA are tests of specific achievement areas. The 
Stanford-Binet is a veil known test of general "intelligence". The Motor 
Inhibition test assesses a child •s ability to control motor behavior. 

1. Overall Effects of the Head Start Experience 

' With regard to the question of overall short term effects of Head 
Start we reached four conclusions: 

Q-* The Head Start experience substantially increased children's 
test scores on all five outcome measures . On four of the five 
outcoT^e measures children's scores were estimated to increase 
"natiirally" over the seven or eight months of the Head Start 
program. Thus, even had the children not been exposed to 
Head Start, their scores would have risen. For two of these 
measures (PSI and Booklet 3D) the Head Start experience was 
estimated to double the "natural" rate of growth. For two 
other measures (Booklet 1*A and the Motor Inhibition tests) the 
Head Start experience was estimated to better than triple the 
"natural" rate of growth. Increments attributable to Head 
Start ranged from 0.26 standard deviations (for the Motor 
Inhibition test) to 0.82 standard deviations (for the Booklet 
hPi test). On the fifth measure, the Stanford--Binet , our 
estimates indicate that the scores of children in this sample 
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TABLE III 



Gains for the total analysis sample on 5 outcome 
measures. Observed gains are partitioned into two 
components — gains attributable to maturation and 
gains attributable to an Head Start experience. 
•All gains are expressed in individual level pre- 
test standard deviations. 



Test 


Observed 

gain 
(total) 


Attributable to 
maturation 
(estimated) 


Attributable to 
Head Start 
(estimated) 


PSI 


0.942 


0.496 


0.446 


Book 3D 


0.727 


0.363 


0.364 


Book 4A 


1.151 


0.333 


0.818 


Motor 
Inhibition 


0.36 


0.10 


0.26 


Stanford- 
Binet 


0.348 


-0.296 


0.644 
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TABLE IV 



Overall Average Chan g es in Moan Tost Scores for Children 

1. . [ J.^ /U- /x r^mai Analysis bample 

2,235 children are represented in the table. Each cell con- 
tains the rr-eon gain and the nurrher of children in the group, 
(Blank cells indicate insufficient N to estimate mean), 
• • • 

Groups 



8 

e> 


nth ni city 


VTi or 
Pre- 
school 
Exper. 


Entering 
Grade 


PSI 

Gains 


Book 

3D 
Gains 


. Book 
4A 

Gains 


MI 

Gains 


Stanford 
Binet 
Gains 


X 


Mexican- 
American 

Mexican- 
American 


No 


El 1 


Xc=l2.5 

9.7 

15^ 


2.6 

irr 
1.7 


4.0 

TTT 
XXx 

2.9 

101 


0.389 






No 


£k 2 
• 


62 
0.463 
TT 


7.5 
40 




White 


No 


.El 3 


10.4 

157 
12.6 
426 


2.5 

2.7 


' 5.8 

JL!>o 
3.7 
4 JLo 


0.284 


7.7 


4 


White 


No . 


Ek 4 


102 
0.4 34 


65 
5.6 










JL68 


147 


•5 


Black . 


Ko 




XJL • J 
i52 
12. 8 
"STl* 


2. 0 


6.0 

250 

4C . 0 

556 


0.380 


.2,6 




Blnck 


No 


Ek 6 


250 . 
2.1 


lib 
0 . 402 


T27 
6.9 








• 


557 




164 


f 


Kcx'i can- 
Amor J can 

Mexican- 
AmCirican 


Yes 

YCB 


El 7 
Ek 8 






• 




• 




niix Vis 


Yes 


. El 9 


7.1 
7.9 


1.9 : 
1.8 : 


5.4 


-o.ooi* 


3.6 




White 


Yes 


Ek 10 


2.7 


'4(5" 
0.460 


31 
2.2 * 








• 




" 32 




ir 


Black 


Yes 


■ El 11 


10.2 


2.5 ' 


5.1 


0.210 


-0.3 * 


i; 


Black 


Yes 


Bk 12 


■'111 
115 


111 

2,2 


ill 

2.6 


49"- 
0.-347^ 


48 
^ r3.4 










114 


.113 
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TotAl 




• 


5reii.4 


2.4 

r;9"93 


3. 'a 


0.36 


4.7 










n«2,00 3 


I79"81 


765 


SIT 






• 




2.9 


4.2 


0.63 


11.1 








pre 


••12a 


3.3 


3.3 


0.54 


13.5 




• 1 






3.1 


4.7 


0.57 


13.2 



*JnOicatcs change io ngL ctatiotically oignificant beyond the .05 )cvel 



ERIC 



00 020 



xy I N ..II 

would have "naturally" decreased ty atout 0.30 standard 
deviations had they not attended Head Start. The Head 
Start experience arrested this apparent decrease and further 
increased Head Start partici-pants ' Stanford*-Binet scores hy 
roup,hly 0.35 standard deviations . 

Children vho had a prior preschool experience gained less over- 
all ("natural" + Head Start related grovrth) than children for 
whom 1970-71 Head Start was their first year of preschool. This 
effect held for all outcome measures and for most of the subgroups 
(see Tahle IV). Featherstone (1973) reached the same conclusions 
for the PSI and the Stanford-Binet using a somewhat different 
analysis strategy. If, however, we allocate the total 
gains for the two groups of children "between "natural growth" 
and the Head Start experience, we find that the effects attri- 
tutahle to Head Start are roughly equal for children with and 
without prior preschool experience (see Tahle V). This indi- 
cates that the expected "natural growth" for children with 
prior preschool experience is less than for children without 
prior preschool. The prior preschool experience appeared to 
reduce differences in test scores between children of different 
ages. In other words, a common preschool experience partially 
overcomes the effect of age differences ajnong children on the 
five outcome measures. Some support for this notion comes from 
the fact that variances on four of the five outcome measures are 
somewhat smaller at post-test time than at pre-test time. This 
indicates that differences among children are less at the end 
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TABLE V 



1? 



.04 C 

0 *0 



Overall Gain for children in the Planned Variation 
Study in 1970-71. "Gains" are corr.outod bv su btracting 
an Kxpectod cost scoi-q" r rcr. r.n ' Co'scrvGo" i>o "su Score?" 
(lilanh ceils ir.aicauc insuirxcicnt nuirJoors oi chiidrcrT' 
to estimate mean "gains,") • . 



8 


jC 
V} 


CU (0 


^ 10 j 


PSI 
"Gain," 


nooti 30 
"Gain" 


Book <)A 


Motor 

Inhib, 


o tunioru** 
Dinet 


-1 


M-A 


No 


El 


Gaino 
6.7 


1.8 


2.6 


0.14* 












n c 111 


'■ 111 


111 


63 






M-A 


No 


Ek 


0.^0* 


0.2* • 


1.9 


0.19 


9.3 










^106 


r 106 


io;i 


22 


• 40 


3 


Khte. 


No 


PI 


4.7 


1.1 


4.7 


0.1.; 


0.0 










157 


158v 




XQ«> 


65 




Whte. 


No 


Ek 


4.0 


1.1 


1.9 ■ 


0.28 


7.0 










*i A V 




418 


169 


147 




BIK. 


No 


El . 


8.4 


2.4 


.4', 8 • 


0.40 


9.6- 










'252 






1)6 


'l29 


6 


BJlk. 


No 


Ek 


6.4 

561 

V A 


1.0 


1,9 


0,38 


10.4 














• 136 


164 


7 


M-A 


Yes 


El . 


— — — 










o 
o 


W •A 




Ek 




• 1 






«mmm 




Whtc. 


Yes 


El 


7.1 

69 


0.4* 

69- 


3.4 


0.05* 


12.2 










69 


' 47 


31 


•0 


Khte. 


Ves 


Ek 


8.0 


2.4 


2.7 


0.47 


14.5 










• 75 


75 • 


74 


' 33 


27 


'1 


BDc. 


Yes 


El 


7.3 


' 1.7 


4.8 


0.03* 


8.4 










111. 


' 111 


' 50 


*~7 48 




Blk. 


Yes 


Ek • 


-0.7* 


-0.2* 


1.1 


. 

0 . 47 


-2.0* 










115 •. 


114- 


113 


i6 


29 








. TOTAL 


5.4 


1.2 


2.7 


0.26 


8.7 








S^gai 


^2003 

„-9.2 


1993 

2.8 


1901 

4.5 


765 

0.56 


686 

12.2 



* Indicates gain not stntifiticnlly cignificuntly greater 
. t-hftn 7.01-0 beyond tho .05 level. ^ . f 
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of the preschool program than they are at the beginning 
of the program ♦ Preschools may have a "fan-close" rather 
tluin a "fan-spread" effect on children. 
Children vho voold enter first grade (El) directly from 
Head Start tend to gain more than children vho would enter 
kindergarten (Ek) directly from Head Start on the Booklet 
1*A, Booklet 3D, PSI and Stanford-Binet tests.. On the 
motor Inhibition test the Ek children gained more. (The 
average age of El children when they entered Head Start 
was 65 months — Ek children were roughly one year younger.) 
The greater gain for El children was most pronounced for 
the Booklet 1*A test and least for the Stai;ford-Binet (see 
Table IV). When the gains attrihutahle to Head Start were 
examined, the effect appears to strengthen, though they are 
Btill small for the Stanford-Binet (see Tahle V). These 
effects are prohahly due to a combination of two things. 
First, the larger gains attributable to Head Start for El 
children on the cognitive measures and particularly the 
Booklet hA test (a measure of letters, numerals, and shape 
names) may be due to older children's advanced academic 
readiness. Second, there may be a greater interest by Head 
Start teachers in El sites in preparing children for reading 
Olid arithmetic. 



A niimber of cites (particularly in the Southeastern areas) do not* have 
kindorcartens • In these sites Head Start children generally go directly 
from Head Start into first grade. 
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There seem to be no consistent differences among Mexican 
American, black arid white children in their Head Start 
gains on the five outccanes. 
A brief mention of the methodological procedures used to arrive at these 
conclusions is in order. Since ve did not have a group of "control" children * 
(children who did not have the benefit of an Head Start experience) our pro- 
cedures for separating "matiiratijn" from "Head Start" gains relied on natural 
variations in prescores for children of different ages. The reader is warned 
to treat these data as rough estimates and to evaluate for himself the assumptions 
of the procedures. I should note, however, that analysis of the third wave of data 

(1971-72), when there was a real "control" group indicate that the estiinates are 

IT— - — 

generally accurate. * 



Since we did not have the appropriate control group in the study — that is, 
a group of children who did not attend preschool — ve had to make our estimates 
of "natural gain" and "gain due to Head Start", by using age variations in the 
chxidren in the study. Briefly, our procedui^e was as follows: First s we 
divided the children into the twelve groups represented in the tables. Second , 
we took the pre-tcst scores of these children on all of the five variables 
ve were interested in and ran 60 separate regression equations (l for each pre- 
test variable for each of the 12 groups). 'The pre-test scores were the dependent 
variables in the r3gression analyses. The independent variables for each analysis 
were age, sex, fam ly income, household size, mothers' education and appropriate 
dtJumay variables to control for missing data. By using the coefficients for these 
equations and the original data, we came up with an "expected" pre-score for each 
child. Within each of the 12 sub-groups, the mean of the expected pre-scores 
equals the mean of the c.ctual pre-scores • 

Our analysis requied two more steps. Remember that the procedure controls for 
the effect of age v/ith the three stratifying variables and their interactions as 
well as for the variables in the regression equations (sex and family background). 
Thuc, with respect to the relationship between age and the PSI test score , ve can 
argue .hat the coefficient for age within any given equation reflects the rate of 
gro^rth for the children within that group — other reasonable things being equal. 
Or in other vords, other things being equal, ve can argue that the main difference 
between the score of a child entering the program when he is U8 months old and his 
score if he had entered the program when he was 55 months old, is reflected in the 
coefficient for age for his particular group. Thus, given this argument, ve can 
c«timHte an cxDoctcd post-test score for the child a score that should approxir.ate 
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2) Differences bctvccn Effects of PV ond Conventional Head Start Pronram 

The second major question regards overall differences in effects 
for Planned Variation and conventional Head Start programs. Although 
we considered the question in some detail, we argue that the question 
has very little importance. For while we might expect there to be 
differences among PV programs in their effects on the five outcome measures, 
we have little reason to suspect that there should "be systematic differneces 
between an overall PV effect and an overall effect of conventional Head Start 
programs. This question, like many total progvaa impact questions, totally 
obscures systematic differences among treatments. 

The sole rationale for studying the question is to .determine whether the 
extra funds allocated to PV Head Start programs had a consistent effect on 
the measured outcomes. Our conclusion supports the findings of a large number 
of recent research efforts which have failed to detect any systematic relation- 
ship of gross expenditure^^ to variations in outcomes. We conclude there are no 
differ ences in effects between the t>v proirroms (taken together) and the Compariscr 
Head Start prop.rams on any of the five outcome measures. 



(con't) the score he would have recicved on a post-test. Assume, for example, 
that a child's estimated pre-score is hO. Also assume that the preschool program 
is 8 months xong and that the coefficient for age is 0.5 — a gain of 1/2 point/ 
nonth. Then we can estimate what the hilds' score would be if. the programs had 
no effect. It would be UO + ,5 (8 months) [Uk]^ expected score. Then if post- 
tert score was ho — the "gain" due to Head Start would be li points. 

We follo^/ed this procedure: for each child in each of the groups on all 
five tests, we estimated an expected " post-test" score. We then subtracted this 
score from his actual post-test score and computed group means. The group means 
are shown in Table 5. I should note two problems with this procedure. First, 
there may be an effect on the post-test score created by the fact that the child 
has taken a pro-test. This is uncontrolled for in this analysis. My guess, h'^v- 
ever, is that this effect is minimal compared to the effect of the preschool 
socialization exijerinece on the test-wiseness of the child. Second, the validity 
of the approach clepends upon the quality of the controls — used both in the strati- 
fication or in the regressions. We have to be able to argue that the coefficients 
for eige are unbiased, 
n 

For details on the amdy^is strategies used to reach those conclusions and the 
conclusions relating to modol-to-model differences see Smith, 1973. As indicated in 
Q the text a variety of analytic strategics were used to rea<jh each set of conclusici.s* 
• ipl'Jiese included Analysis of Covarianco, matcliing teohniqucsxthe standardization tec:.- 
^aa^iQUo used in the assessment of the overall Head Staia effects. 0 0025 



3) Differential effects of the Various Planned Variation Models 

The third question addresses differences anonc PV progrwns in their 
effects on Head Start children* 



model effectiveness* The eleven PV riodels are the rows of the table while 
the five outcome measures are each represented hy a column of the tahle* 



The cell entries indicate ef fectr oness relative to the other PV models end 
to appropriate conventional Head Start classrooms* Five general conclusions 
may he reached after inspection of this tahle. 



We began the study with the expectation that there will be few 
strong differences among the models in effectiveness as 
assessed by our five outcome measures. By and large this ex- 
pectation was realized* Table VI indicates that for each of 
the outcome measures ve have classified the majority of the 
models as having average effectiveness. Moreover, no model 
stands out as either more or less effective than the others on 
more than two of the five outcomes. In the crudest terms the> ^e 
ore no overall winners or losers , 
b , A second and mo r e tentative expectation vas that models which 
emphasized academic drill combined with systematic reinforcement 
y ould be more effective than other models on the four corrnitive 
outcome measures. This expectation vas realized only for one of 
the four cognitive ineasures . Only for the Booklet measure — 
a test assessing knowledge of letters, numerals ^ and shape names — 
is there evidence of greater effectiveness for the models emphasizing: 



Table VI 



summarizes our findings regarding differential 
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TABLE VI 



Summary of Planned Variation Model Effectiveness on Five 

Outcome Measures 



Zero (0) indicates model is of average effectiveness on' 
outcome measure. 

Minus {-) indicates model may be of below average effec- 
tiveness. 

Plus {+) indicates model may be of above average effec- 
tiveness . 

Double plus (++) indicates model is probably highly 
effective. 



Model 


Book 
3D 


Book 
4A 


PSI 


Stanford 
Binet 


Motor 
Inhibition 


Par West 
Laboratory 


0 


0 


0 


0 


0 


Arizona 


0 


0 


0 


0 


0 


Bank St. 


0 


0 


0 


- 


+ 


Univ. of 

Oregon 


0 


+ 


0 


0 


0 


Univ. of 

Kansas 


0 


++ . 


0 


0 


+ 


High 
Scope 


+ 




0 


++ 


0 


Univ. of 
Florida 




0 


0 


0 


0 


EDC 


0 


0 


0 


0 


0 


Univ. of 

Pittsburah 


0 


+ 


0 


+ 


mm 


REC 






0 


+ 


0 


Enablcrs 


0 




0 




+ 
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drill and reinforcement • The uro ^rsity of Kansas model 
is the clearest example of this finding. We found it to 
be clearly superior to all of the other models and to the 
Comparison classes in its effectiveness in raising Booklet 
1»A test scores. The two other models which emphasize aca- 
deaic drill (University of Oregon and University of Pitts- 
burgh) both appear to he above average in their impact on 
this test. 

On the other cognitive tests there is no indication of 
special effectiveness of these three models. Only the Univer- 
sity of Pittsburgh model on the Stanford-Binet shows an other 
than average effect. These findings are at some variance with 
the finding of other researchers in the pre-school area (see 
Bissell, 1970; and White, S., et al. , 1972). These researchers 
indicated that there may be a general positive effect of 
structured academic emphasis and drill on cognitive tests. Our 
data, however, indicate that the effect is specific rather than 
general. In particxilar it appears as if this approach may be more 
effective for imparting information that is easily taught through 
syster.atic drill while it is only of average effectiveness in 
other cognitive areas. Of the four cognitive tests the Booklet 
1*A test most clearly assesses specific skills. The other tests j 
particularly the PSI and the Stanf ord-Binet , assess general in- 
formation and cognitive functioning. 
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• One model clearly stands out as more effective than the others ^ 
in raising Stanford->Binet test scores. The High Scope PV model 
appears to increase Stanford-Binet scores "by an estimated twelve 
to fifteen points, roughly 0.9 individual level standard deviations . 
The average effect of other PV and Comparison models is on the order 
of three to four points or roughly 0.3 standard deviations. The 
estimates of these effects are "based on analysis of two very different 
sites. They also replicate the general findings from the 1969-70 
cohort (Featherstone, 1973)- The effect of the High Scope model is 
particularly strong in one Southern rural site where the measured 
average gain is slightly over thirty points. Although we can probably 
attribute some of the measured gain to tester and regression effects, 
the "corrected*' gain in this site is still on the order of a very 
substantial twenty points. Preliminary analyses of the itera profiles 
of children in the High Scope sites indicates that the gains apply over 
the entire range of appropriate Stanford-Binfet items (see Butler ,1973) . 

The particular effectiveness of the High Scope model on the 
Stanford-Binet does not appear to generalize to the other outcome 
measures used here. For three of the four remaining tests the 
moo.el appears to be of only average effectiveness. On the fourth 
teBt, Booklet 3D, there is some indication that the High Scope 
model may be of above average effectiveness but no firm conclusion 
may be reached from the data. 
/ 
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d. Two of the eleven models (University of Pittsburgh and REC) 
account for hO% of the 15 cells in Tahle VI where there is 

an indication that a model has other than average effectiveness 
on an outcome measure. Pittsburgh appears ahove average on the 
Booklet hA and Stanford-Binet tests and helow average on the 
Motor .Inhibition test* REC appears below average on the Booklet 
3D and Booklet hk tests and above average on the Stanf ord-Binet . 
No other model is rated as other than average on more than two 
of the measures. Three things are common to REC and Pittsburgh. 
Each uses some form of programmed instruction, each was a first 
year model in 1970-71 > and each has only one site in this study. 
Although the first two common elements may be important, our 
inclination is to view the fact that each model has only one site 
as the principal reason that these models have more than their 
share of "other than average" effects. This may be due to dif- 
ferential effectiveness of models in different sites or to un- 
controlled biases in our data. ^sT:atever the reason, our incli- 
nation is to be very skeptical about attributing clear effects to 
any model with only one site% 

e. All models are rated as showing average effectiveness on the PSI 
test. Me had not expected this restilt since our preliminary 
analyses of the PSI indicated that it is probably our most relifible 
measure. In retrospect, however, we suspect that the reason for 
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the lack of clear differences among models on the PSI 
is due to the nature of the test itself. The PSI was 
developed as a general test to assess the overall impact 
of preschools on children. As such it attemtps to measure 
a wide range of skills protatly rendering it relatively 
insensitive to particular differences among curricula. Thus 
it is protatly more appropriate to the tasks of assessing the 
overall average impact of preschools and of individual differences 
among children. 

This brief summary highlights the central findings of the main effects study. Th 
conclusions regarding the effects of particuliar sponsors held up in a numter of dif- 
ferent types of analysis. In the next section we consider the question of whether th 
effects of the various preschool curricula are sensitive to differences in chi3.d 
characteristics. 

IV Child Characteristic ty Model Interactions 

Two general strategies were followed for the analysis of child characteristic 
ty model interactions in the 1970-71 HSPV data. First,' Saaftth, 1973 explored a 
variety of potential interactions in the context of the main effects study to 
determine if general conclusions atout main effects should te tempered "by the 
understanding that the main effects are only averages ajid that the programs work 
somewhat differently for different kinds of people. In keeping with the "con- 
servative" nature of this study the exploration was limited to instaiices where 
clear main effects were present. The conclusion from these analyses is clear. 
While it is certainly true that the models which showed large effects were not 
equally successful for all examined typos of children, major disordinal inter- 
actions did not appear. Thus, a model that was effective on-the-average for a 
certain test was not particulari]y "effective" for one type of child and mrti- 
ERIC cularily "ineffective" for another, (^^^^^^s especially true for the two very j 
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prograw appeaured to produce gains on the Stanford-Binet which exceeded those 
of almost all of the other programs for all types of children. A similiar 
conclusion applies to the effectiveness of the University of Kansas with the 
Book 1»A test. . . - . 

The second type of exploration was caurried out hy Featherstone , 1973. Her 
analyses focused directly on the question of child-chau:acteristic hy model inter- 
actions as a research issue rather than as a mechanism for interpreting main 
effects. The focus in these analyses was limited to the PSI and the Stanford- 
Binet. Three general sets of findings emerged from her analyses. 

1) . Although analyses of interactions in hoth the I969-7O and in the 1970- 
71 suggest that there are a lot of things going on in the data there was little 
comparability between conclusions that could "be reached for the two years. More- 
over, it was generally the rule rather than the exception that the two outcome 
measures reacted differently vrith regard to the interactions. These findings may 
"be due to a variety of factors including the relative weakness of the I969-TO 
data, non-comp£u:ahility between the samples in the two years, an advanced state 
of .implementation of the models in the second cohort, or a general lack of real 
consistency in the effects. For an important exception to this generalization, 
note the third finding below. 

2) . A focus on the second year data combined with an a priori grouping of the 
models to add power to the analysis, however, provided a nvmiber of very provocative 
findings. In these analyses two "more directive" or "adult controlled" models 
(University of Oregon amd University of Kansas) were contrasted with three "less 
directed" and "otudent initiated" models (EDC, Bank Street and Far West Laboratories 
A number of findings emerged from these comparisions. The "more directive" models 
tend to favor: a) children without preschool experience; b) children who initially 
score on the low end of the distribution (PSI only); c) boys with preschool expericn 
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in contrast to girls "irith preschool experience; and d) younger children. The "less 
™" 0 0 0 2 



directed" models favor a) children with preschool experience (PSI only); and 
t) initially high scoring children. Interactions of SES and ethnicity with model 
type are generally negligible. Vfhile all these findings are not reproduced in "both 
cohorts, they appear to te somewhat coherent and deserve close attention in 
future studies. 

3). One set of findings appeared to replicate in the two cohorts. Associated 
with the administration of the Stanford-Binet was a "behavioral scoring system 
developed "by Hertzig and Birch (Hertzig et al. , I968). Two measures adapted from 
this scoring system and assessed at pre-test time were used to categorize children 
One measure, "passivity" was indicated "by a child remaining silent and passive if 
he did not know the answer to an item. The second measure, "competence" was 
indicated "by a child attempting to answer items he was unfamiliar wi*th and 
elaborating on correct answers. Children were classified as high or low on these 
two measures. In "both years two general tendencies were discovered: "more 
competant" and "less passive" children were more successful in the "less directive 
models and "less competent" and "more passive" children were more successful in 
in the "more directive" models. The findings were clearest for the Stanford-Binet 
thought the PSI data tended to fall in the same pattern. Of crucial importance 
in interpreting these findings is the fact that the "passive" and "competence" 
measures vrere the only directly o"bserved "beliavioral independent variables used in 
the interaction study* It is very possible that future studies of child- 
characteristic by curriculum model interactions will waste their time if they don' 
focus on behavioral rather than "sociological" variables.- 
V, Conclusions 

A number of the findings of this study are not surprising. Numerous studies 
have shown that Head Start has a clear and strong short term effect on cognitive 
outcones (see e.g., Datta, 1972; or Sterns, 1972). And many more studies of 
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..OCX e...=ts indicate t.at it is »liV.oly t.at a„. one cu.ri=ul- win 
3.0. a clea. a.va„.ase ova. ot.e. curricula in the production o. cognitive 
.rovt. (see .enc.= . i^-. 1973). However, .or t.e e..ect. ,.ich .o persist 
our anal.se. there is a see.in^. clear relation .et.een the .oals o. a 

A nh-'ld outcomes "produced" by the curriculum. The 
preschool curricula-r. and the chad outcomes p 

^ ^■L. 4.„„+ ,,-f> iptters and numerals, 

3ucce.s Of the academically oriented curricula on the test of 

Hi.h scopes success on the Stanf ord-Binet . and the apparent success of the 

n .ve" and generally less ready children 
..directive models" with younger, more "passive and gener 

are the tbree principle examples . 

.sth the aT^mrent insensitivity. of the Preschool 
These findings, along with the apparemi 

„ to ai^erent curricula, see. to indicate that evaluations o. preschool 
school, c^ricula should attend .ore to the soals o. the curricula than .o 

• the tests. As indicated in the discussion 

the psychometric characteristics of the tests. 

o. no„-co.nitive measures, o. course, this is easier said than done. At the 
laast. however, .e need to have a diversity o. co.itivel. oriented measures n 
assessments. .or e..ple. »e had used onl. the .ost .VeUahle and 

„idel. used test in this stud, (the PSI) . we »ould have had ver. little o. 
interest to report. 
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APPENDIX I 

The Curriculum Approaches (Models) 

This appendix "briefly describes the tvelve models used in the 

Planned Variation study in 1970-71 • As ve noted earlier, each of the 

approaches, vith the exception of the Enahler model, has heen developed 

and is sponsored ty some group of people in a university or private 

> 

corporation. The descriptions are intended to reflect the goals and 
expectations of the sponsors rather than to he a critical analysis. As 
presented, they are idealized descriptions of the tvelve treatments. These 
sponsored approaches vere included in Head Start Planned Variations "because 
they vere considered to he promising methods for vorking vith disadvantaged 
children ajid families and hecause they vere unique in some significant vay. 
Nevertheless, the sponsors i?hare common orientations. All of them seek to 
develop children's learning abilities. All are convinced of the importance 
of individual and small group instruction and. frequent inter chnage betveen 
children and concerned adults. All attempt to make learning interesting and 
relevant to the child's ciatural background. All believe that the child's 
success in learning is inseparable from his self-esteem, motivation, autonomy, 
and enrironmental support, and all attempt to promote successful development 
in these domains vhile fostering academic goals. The sponsors differ among 
themselves chiefly in the priorities vhich they assign to these objectives 
and in the sequences through vhich they pursue them. 
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It is important to recocnize that the concept of Planned Variation 
vas not intended as a means of finding a single "best" method for educating 
disadvantaged children. A wide variety of groups of children are included 
in this study, and a progrwn that is appropriate for some may not be appro- 
priate for others. Some approaches, for example, are primarily concerned 
with parental involvement and co^unity control, while others place primary 
emphasis on the curricula^, the teacher, and the classroom. The following 
paragraphs briefly attempt to capture the emphasis of each model. 
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HOC ()!»c:n I'idu cnti o n Curriculum 
Bducritlonal "Dcvclo'pmcnt Corporation (EDC) 

Sponf;or Contact ; George Hcin 

EDC has an open classroom approach derived from the 
British primary school model and theories of child develop- 
ment. It believes that " learning is facilitated by active 
participation in the process. The classroom provides a 
setting in which there is a range of materials and activities 
from which the child can choose. Academic skills are 
developed in a self-directed v/ay through . classroom experi- 
ences. The role of the teacher is one of leading the child 
to extend his own work and generally involves working with 
an individual child or small group.. 

The S yst ematic U se of Behavioral Principles Program 
(Enge'l nia n n - 3 e c >; e r ) 
University of Oregon 

Sponsor Contact : Wesley Becker 

The primary focus of the Engelmann-Becker program 
is on promoting skills and concepts essential to reading, 
arithmetic and language achievement, with particular 
emphasis on remedying language deficiencies! The main 
techniques are programmed materials, structured rapid- 
fire drills, and positive reinforcements of rewards and 
praise to encourage desired patterns of behavior. Small 
study groups of five to ten children are organized by 
* teachers according to ability levels in order to facilitate 
presentation of patterned learning materials and Lo elicit 
verbal responses from children. 
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The Pt-^nk StroGt College of Educntion Approach 
Bank SUrcoU College of Education 

Sponsor Contact : Elizabeth Gilkeson 

The Bank Street approach emphasizes both learning and . 
social-emotional development of children on the premise 
that they are intertwined. The teacher functions as a 
supportive adult whom the child can trust,, and teaches by 
relating and expanding upon each child's response to his 
experiences. The classroom is viewed as a stable environ- 
ment and workroom for the child in which he is encouraged 
to explore, make choices and carry out plans. Academic 
skills are presented in the context of classroom experiences. 

The Behavio r Analysis Approach 

Support~and Development Center for Follow-Through, University 
of Kansas 

Sponsor Contact ; Don Biishell 

The Behavior Analysis approach has three predominant 
aspects. First it emphasizes academic and .social skills. . 
Individualized programmed materials are the primary 
teaching mode. Second '.t makes systematic use of positive ■ 
reinforcement. A token exchange system is used to support 
children's learning efforts. Third it employs parents as 
members of the instructional team as well as behavior 
modifiers. They receive training and work in the cl issroom 
in shifts throughout the year. 
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Loariiiny Research and Development Center, Univ. of Pi 



The IPI approach provides an individualized program 
of instruction for each child which teaches him academic 
skills and concepts in the areas of language, perceptual 
motor mastery, classification, and reasoning. The materials 
are sequenced to reflect che natural order in which children 
acquire key skills and concepts. Diagnostic tjssts determine 
each child's strengths and weaknesses and are used by the 
teacher to prescribe instructional materials appropriate 
to his needs. Positive reinforcement, both social and 
.concrete, is given continually for success in learning. 

The Responsive Environments Corporation Model (REC ) 
Responsive EnvTronments Corporation 

Sponsor Contact : Lori Caudle 

The REC model uses specially designed, self-correcting 

multi-sensory learning materials which strengthen school 

readiness skills in language and reading. They are designed 

to teach basic concepts while allowing children to make 

choices, work independently, and set goals for themselves. 

Teaching machines in the form of "talking typewriters" 

and "talking pages" involve children in learning by seeing, 

tracing, typing, imitating and discriminating among sights 

and sounds and by recording and listening to their own 



Sponsor Contact ; Lauren Resnick 



voices . 
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The FToridcx P^irc :nt EducAtor Model 
University oi Florida 

Sponsor Contact ; Ira Gordon 

The Florida approach is not a specific classroom 
instructional ir.odel but is designed to work directly in the 
home. It focuses on the parent, believing that the parent 
is the key agent in a child's development. The major goals 
of the program are to develop educational competence in 
the child and to develop an atmosphere in the home which 
will foster continued growth. An important role is played 
by paraprof essionals called parent educators. The parent 
educator spends half-time with the teacher in the class- 
room and the other half making home visits. The home visit 
involves bringing tasks into the home and instructing the 
mother how to teach them to the child. 

The Tucson Early Education Model 
University of Arizona 

Sponsor Contact ; Ron Henderson 

The Tucson model has a flexible child-oriented 

curriculum which focuses simultaneously cn four areas of 

development: language competency, inteliectua.1 skills, 

motivational skills and societal skills. EmpJiasis is 

placed more on learning to learn skills than cn specific 

content. The content is individually determined by a child's 

environment and interests. The classroom is arranged in 

interest centers for small groups. The teacher's role is 

to work on a one-to-one basis with the child, arrange the 

Q classroom sotting and encourage interactions between the 

child, his environment and others. 0 0043 



Kosponnivc Education al Program ^^^»^^^r^4. 
!Sr-^wbsh:aboIa^^^^^ Research and Dovolopmont 

Sponoor Contact ; Glen Nimnicht 

The Responsive Educational model emphasiacc celf- 
rewardincj learning activities and a structured environment 
responsive to a child's needs and interests. The model 
encourages the child to make interrelated discoveries 
about his social world and physical environment and 
stresses the" importance of the development of a healthy 
self-concept. The classroom is a controlled environment 
in which the child is free to explore various learning 
centers, games and activities. Problem solving and concept 
formation as well as sensory and perceptual acuity are 
stressed and the pace of all learning activities is set 
by the child for himself. 

Coqnitivc l y Oriented Curriculum 
Hi/Scope Educational Foundation 

Sponsor Contact ; David Weikart 

The Cognitively Oriented Curriculum combines Piagetian 
theory and an open classroom approach. It uses a cognitively 
oriented curriculum and emphasizes the process of learning 
rather than particular subject matter. It stresses a 
child's active involvement in learning activities. The 
teacher takes an active role. Additionally, home training 
is seen as part of the program and the teacher suggests 
tasks for the mother to present to the child at home. 
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T he Knablo r Model 

Office of "child Development 

Sponsor Contact : Jenny Klein 

The Enabler Model does not involve affiliation with 

a particular instructional approach. It is build on goals 

prescribed by each community for itself. The development 

and' implefaentation of this ifiodel ^rd facilitated- by- the 

assistance of an CCD consultant who takes a very active 

role in all aspects of the program* Thus projects with 

the Enabler Model n^ay differ considerably in the approach 

and style of their educational tactics, but all share 

a comjnitment to high levels of parent participation in 

policy making, program planning and classroom operation. 
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APPENDIX II 

1. Classroom Information Form : This instrument vas used to gather in- 
formation about the "background and family characteristics of every child 
in the sample. Teachers completed the instrument hy gathering information 
from Head Start application hlanks and interviews with parents. A validity 
study of selected items from a similar form used in 197^-72 yielded en- 
couraging results (see "The Quality of the Data"). 

2. California Preschool Social Competency Scale : This is a teacher completed 
rating scale fo 30 items designed to "measure the adequacy of preschool children's 
interpersonal "behavior and the degree to vhich they assume social responsibility" 
(Levine, et al . ^ 1969, p. 3). An extensive description of the measure is 
included in "The Quality of the Planned Variation Data", Completion of the 
scale by teachers suggested to us that among classroom and among site 
comparisons would be illegitimate. The reason is simply that teachers may 
consider their o\m classrooms as the reference group for rating students. 

Since the compositions of classrooms vary greatly, the ratings may lose 
comparability when they are taken out of the immediate context of their 
classroom. 

3. Sponsor Ratinrts of Implementation : This rating form is fully described 
and analyzed in the repor-b on "Implementation". 

1*. Head Start Directors P.aljngs of Iifmlcrnentation : This form is similar to 
the Sponsor Rating except that it vas completed by the Head Start Director. 
It is discussed in the report on "Implementation". 
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5. Tea cher and Teacher Aide Survey : These forms assess teacher and teacher 
aide tackGround, teaching experiences and attitudes tovard the Planned Varia- 
tions model. They are extensively analyzed in the "Implementation" report. In 
this report, ve use some items taken from these surveys. 



instrument developed at the Stanford Reseaurch Institute to assess the degree of 
imp^enen^ation 5f classroom processes aurid child outcomes* in the varioCis programs. 
Trouble vith the coding on the classroom observation tape limi^ted our use of this 
important instrument. An analysis of some results from it are included in the 
report on "Implenentation" and an extensive analysis of its use in 19T1~T2 is 
tinder preparation by SRI. 

7. Basic Child Test Battery: Four tests are included in this "battery. The 
results from these tests are extensively analyzed in this report. Additionally, 
results from one of the tests, the Caldwell Preschool Inventory, are used in the 
report on "Cognitive Effects of Preschool Programs on Different Types of Children". 
Complete descriptions of the tests are in "The Quality of the Planned Variation 
Data". The four tests are: 

a. Caldvell Preschool Inventory . (PSI) .The PSI vas developed to assess 
general achievement in preschool in areas deemed necessary for later success in 
school. Spe:;ifically developed for preschool populations, 6k items tap areas 
of general knowledge, listening and vord meanings, listening and comprehension, 
writing, copying, quantitative skills, and speaking and labeling. Though the 
test was originally designed to have four factors, factor analyses of our data 
revealed only one factor which seemed to cut across ell areas tapped by the test. 
Consequently, vo simply sui-u^^ed the items to create a score on the test. Internal 
(l3\-20) reliability is roughly .PO. By and large, we consider this test a 



6. Classroom Observation Instrument : This is a broad range objective observation 




JC measure of general achievement in preschool. 
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The scoring procedure for the test is not normed 
for age and as a consequence, pre-scores on the 
PSI are highly and positively correlated with the 
age that the child enters the program. The PSI 
also correlates roughly 0.50 with the Stanford- 
Binet, which in turn has a slightly negative 
correlation with age'. The Stanford-BineflQ- 
score is obtained by dividing .a calculated Mental 
age by chronological age — the division by age 
makes* the IQ score comparable across ages. The 
Mental age score taken alone can be thought of as 
the Binet score uncorrected for age. Mental age 
on the Binet correlates roughly .75 with the PSI. . 
Assuming both tests have a reliability of .9, we 
find that the correlation among the "true score" 
parts of the PSI and the Binet score unadjusted 
for age is roughly .83*. Though this correlation 
is far from perfect, it suggests that the two tests 
are tapping somewhat the same domain. 

b. NYU Book 3D . The NYU booklets were designed to me as 
areas of specific preschool achievement. Book 3D 



♦The samole used for these estimates and other estimates on 
following pages of this chapter was th^- same sample used for 
the correlation matrix on Page 120 of The Quality of the 
Planned Variation Data" for estimates of the reliabilities 
of the two tests. 
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is dcs5 .'Xsd to tap achievement in pre-math (seven 
items) / 1 j.-c-science (seven items) , and linguistic 
concepts (five items assessing knowledge of pre- . 
positions) . Both NYU booklets (3D and 4A) were 
extracted by SRI from the Early Childliood Inventories 
developed by A. Collier and J. Victor at the Institute 
for Developmental Studies at the NYU School of Educa- 
tion. Two scoring systems are used in the analyses 
in this report. First, a simple summary score 
obtained by adding together all correctly answered 
items is used. A factor analysis of the Book 3D 
suggested that there was only one stable, interpretable 
factor.* Estimates of internal reliability for the 
total score are generally in the range of 0.60-0.70. 
In this report we use 0.65 as a reliability estimate 
for individual scores. Moreover, the single score 
seems to have a ceiling problem for some groups of 
older children on the post-test results. See "The 
Quality of the Planned Variation Data" for discussions 
of these issues. Second, a set of scores is obtained 
by considering the three sub-tests as criterion- 
referenced measures. Using these measures, we 
report the percentages of children in various sites 

*A factor analysis of Books 3D and 4A together convinced 
us to keep the tests separated for analytic purposes. 
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and models for each sub-test v;ho obtain either 
a perfect score or only one item incorrect at post- 
test time. We also report the percentages of 
children in these groups who fail to get more than 
one item correct on each sub-test. 

A score derived from a summing of correct 
items for Book 3D bears a very strong relationship 
to the PSI. By and large, different sub-samples of 
the data reveal correlations of about 0.70 at pre- 
test time, ^ 



Adjustment of this 
correlation for the reliabilities of the two tests 



bility is roughly 0.6 5.^ yields a corrected correla- 
tion coefficient of roughly 0.9 5 indicating that the 
two tests are tapping almost entirely the same domain 

NYU Book 4A> This test is designed to tap achieve- 
ment in three areas: knowledge of alphabet names 
(nine items) ; knov;ledge' of numeral names (six items) ; 
and knowledge of shape names (three items). The 
development of scores for this test was similar to 
the development of scores for the Book 3D. A 
single summary score is analy7-ed in this report 
along with three criterion-referenced measures. With 
the exception of the third sub-test we follow the 




(PSI reliability is roughly 0.90 and Book 3D relia- 
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same rules for creating our criteria, as we did 
for Book 3D. In the third sub-test, we required 
that the student answer all three questions correct- 
ly to nieet the criterion. The single score on 
Book 4A has an internal reliability of 
roughly 0.65 for the pre-test. To some extent 
this reliability is reduced by a nvisior floor prob- . . . 
1cm in the Fall testing. For all groups the Book 4A 
scores were positively skewed in the Fall and more 
normally distributed in the Spring. Pre-test scores 
for Book 4A and the PSI correlate roughly 0.45-0.50, 
with the Book 3D the correlations are roughly 0.40- 
0.45 and with the Stanf ord-Binet, the correlations 
are roughly 0.40. Overall, then, though the Book 4A 
is assessing somewhat similar areas as the PSI, 
Book 3D and the Stanford-Binet, there is considerable res 
al unique varriance associated witix the test. 

Mntor Inhibition Test . This test was developed by 
Hagen and Degerman (see Maccoby et al., 1965) to measure 
a child's ability to inhibit movement when the task 
demands it. Three tasks are used to assess inlii- 
bition, the Draw a Line slowly task, the Walk sla^y 
task, and the Pull Truck slowly cask. Four pre- 
liminary items assess the child's understanding of 
the concepts of slow and fast. A substantial proper- 
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tion of the sample of children in this study 
(over 50%) failed to ansv/er two or more of the , 
four pre-test itenis correctly, in either the Fall 
or Spring, indicating that these children did not 
understand the two concepts. The scores on 'the 
Motor Inhibition test were not analyzed for these 
children. Analyses of the"* three sub-'tests ' indicate'd 
that the first two tasks yielded scores that cor- 
related roughly 0.46. Correlations of the first 
two tasks with the third task were roughly 0.24. 
The low correlations v;ith the third sub-task indi- 
cated to us that it was either unreliable or was 
measuring something other than the other two. svb- 
tasks. Consequently, we formed a measure of the 
Motor Inhibition by summing the amounts of time in 
seconds taken to complete the first two s\±>-tasks. 
Following Maccoby's lead and an inspection of the 
dato/ the log of this score was then taken. The log 
•transformation removed the strong positive skewness 
from the new scores. This final score correlates 
in the 0.30 to 0.40 range witli the NYU 3D and PSI 
and in the 0.15-0.20 range with the Book 4A and 
the Stanford-Binet. 



8. Hthnic HoriUioc Tos t: Two tests were actually used 

* here. The Ethnic Identity Questionnaire (EIQ) was developed 
by Manuel Ramirci: III at the University of California, 
Riverside, to investigate the ethnic identity of Mexican- 
American children and the Children's Cultural Awareness 
Scale (CCAS) was developed by Edward J. Barnes at the 
University of Pittsburgh to explore the cultural awareness 
... .of ina^:k ghildren in the Head Start Planned Variation Study. 
Scores from neither test are used in this report. 

• 

9. Stanford-Binet ; The Stanford-Binet Intelligence 
scale is a well-known measure of "general intelligence". 
The 1960 revision was used 'in this study. A single 
measure of IQ is used in this report. After extensive 
checking for matched pre- and post- birthdates and valid 
items, the score was calculated by dividing a child's 
m\tal Age derived from the test 'by his chronological age 
in months and then corrected for age-related fluctuations 
in variance using the revised Pinneau tables (see Terman 
and Merrill, 1960) . . ' 

« 

10. 8-niock Sort Task Test ; The Eight Block Sort Task 
is a measure of maternal teaching style and interaction 
styles between mother and child. The score used in this 
. report ranges from 0-8 points and indicates the success 
of the mother in teaching the sorting tasks to the child. 
(See "The Quality of the Planned Variations Data" for an 
ERIC extcnr.ivo discunsion of thic measure.) 
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